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Abstract 

In this paper we prove lower bounds on randomized multiparty communication complexity, both in 
the blackboard model (where each message is written on a blackboard for all players to see) and (mainly) 
^^ , in the message-passing model, where messages are sent player-to-player We introduce a new technique 

Cn ' for proving such bounds, called symmetrization, which is natural, intuitive, and relatively easy to use in 

comparison to other techniques for proving such bounds such as the icost method |I5|. 

For example, for the problem where each of k players gets a bit-vector of length n, and the goal 

is to compute the coordinate-wise XOR of these vectors, we prove a tight lower bounds of Q,{nk) in 

the blackboard model. For the same problem with AND instead of XOR, we prove a lower bounds of 

roughly O(rifc) in the message-passing model (assuming k < n/3200) and 0(ri log k) in the blackboard 

model. We also prove lower bounds for bit-wise majority, for a graph-connectivity problem, and for 

other problems; the technique seems applicable to a wide range of other problems as well. The obtained 

\^ • communication lower bounds imply new lower bounds in the functional monitoring model ifTTl (also 

C/3 I called the distributed streaming model). All of our lower bounds allow randomized communication 

^, ' protocols with two-sided error. 

We also use the symmetrization technique to prove several direct-sum-hke results for multiparty 
communication. 
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1 Introduction 

In this work we consider multiparty communication complexity in the number-in-hand model. In this model, 
there are k players {pi , . . . , pk}, each with his own n-bit input Xi G {0, 1}". The players wish to collaborate 
in order to compute a joint function of their inputs, f{xi, . . . , x^). To do so, they are allowed to communi- 
cate, until one of them figures out the value of /(xi , . . . ,Xk) and returns it. All players are assumed to have 
unlimited computational power, so all we care about is the amount of communication used. There are three 
variants to this model, according to the mode of communication: 

1 . the blackboard model, where any message sent by a player is written on a blackboard visible to all 
players; 

2. the message-passing model, where a player pi sending a message specifies another player pj that will 
receive this message; 

3. the coordinator model, where there is an additional {k + l)-th player called the coordinator, who re- 
ceives no input. Players can only communicate with the coordinator, and not with each other directly. 

We will work in all of these, but will mostly concentrate on the message-passing model and the coordinator 
model. Note that the coordinator model is almost equivalent to the message-passing model, up to a log k 
multiplicative factor, since instead of player i sending message x to player j, player i can transmit message 
(j, x) to the coordinator, and the coordinator forwards it to player j. 

Lower bounds in the three models above are useful for proving lower bounds on the space usage of 
streaming algorithms, and for other models as well, as we explain in Section 11.21 Most previous lower 
bounds have been proved in the blackboard model, but lower bounds in the message-passing model and the 
coordinator model suffice for all the applications we have checked (again, see more in Section [Oi l. 

Note that another, entirely different, model for multiparty communication is the number-on-forehead 
model, where each player can see the inputs of all other players but not his own input. This model has 
important applications for circuit complexity (see e.g. |[23l ). We do not discuss this model at aU in this 
paper. 

We allow all protocols to be randomized, with public coins, i.e. all players have unlimited access to a 
common infinite string of independent random bits. We allow the protocol to return the wrong answer with 
probability e (which should usually be thought of as a small constant); here, the probability is taken over 
the sample space of public randomness. Note that the public coin model might seem overly powerful, but in 
this paper we are mainly interested in proving lower bounds rather than upper bounds, so giving the model 
such strength only makes our results stronger. 

For more on communication complexity, see the book of Kushilevitz and Nisan |[23l . and the references 
therein. We give some more definitions in the preliminaries, in Section |2l 

1.1 Warm-Up 

We begin by sketching two lower bounds obtained using our technique, both of them for the coordinate-wise 
XOR problem. These lower bounds can be proved without using symmetrization, but their proofs that use 
symmetrization are particularly appealing. 

First consider the following problem: Each player gets a bitvector Xj G {0, l}*^ and the goal is to compute 
the coordinate-wise XOR of these vectors. We operate in the blackboard model, where messages are posted 
on a blackboard for all to see. 

Theorem 1.1. The coordinate-wise XOR problem requires communication Vt{nk) in the blackboard model. 



To see this, first let us specify the hard distribution: we prove the lower bound when the input is drawn 
from this distribution, and by the easy direction of Yao's Minimax Lemma (see e.g. ll23l ). it follows that 
this lower bound applies for the problem as a whole. The hard distribution we choose is just the distribution 
where the inputs are independently drawn from the uniform distribution. 

To prove the lower bound. Consider a protocol P for this /c -player problem, which works on this distri- 
bution, communicates C{P) bits in expectation, and suppose for now that it never makes any errors (it will 
be easy to remove this assumption). We build from P a new protocol P' for a 2-player problem. In the 
2-player problem, suppose that Alice gets input x and Bob gets input y, where x, y G {0, 1}" are indepen- 
dent random bitvectors. Then P' works as follows: Alice and Bob randomly choose two distinct indices 
i,j E {1, . . . , k} using the public randomness, and they simulate the protocol P, where Alice plays player 
i and lets Xi = x, Bob plays player j and lets Xj = y, and they both play all of the rest of the players; 
the inputs of the rest of the players is chosen from shared randomness. Alice and Bob begin simulating 
the running of P. Every time player i should speak, Ahce sends to Bob the message that player i was 
supposed to write on the board, and vice versa. When any other player pr (r / i,j) should speak, both 
Alice and Bob know his input so they know what he should be writing on the board, thus no communication 
is actually needed (this is the key point of the symmetrization technique). A key observation is that the 
inputs of the k players are uniform and independent and thus entirely symmetrically and since the indices i 
and j were chosen uniformly at random, then the expected communication performed by the protocol P' is 
E[C(P')] = 2C{P)/k. Furthermore, when the players have finished simulating P' , Alice now knows the 
bit-wise XOR of all the vectors xi, . . . ,Xk, from which she can easily reconstruct the vector Xj (since she 
already knows all the other vectors x^ for r / j). It follows that using 2C{P)/k expected communication, 
Alice has managed to reconstruct Bob's entire input; from an easy information-theoretic argument it follows 
that 2C{P)/k > n, so C{P) > Q{nk), proving the theorem. 

Extending the above argument to cover the case where the protocol P is allowed to return the wrong 
answer with probability e is also easy, simply by showing that if Alice managed to learn Bob's entire bit- 
vector with probability 1 — e, then (1 — e) • n bits must have been communicated: this also follows easily 
from information- theoretic arguments. 

Note that crucial fact that the hard distribution we chose is symmetric: if the distribution of the inputs 
to the k players was not symmetric, then the protocol P could try to deduce the indices i and j from some 
statistical properties of the observed inputs, and act according to that. Then the best upper bound we could 
get on the communication complexity of P' would be C{P') < C{P), which is much too weak. 

We have just described the version of the symmetrization method for the blackboard model. A similar 
(slightly more complicated) line of thinking leads to a lower bound of ^{n log k) on the complexity of the 
coordinate- wise AND problem in the blackboard model; see Section im 

Let us now sketch the symmetrization method as it is used in the coordinator model, where players can 
only send and receive messages to and from the coordinator. We prove a lower bound on the same problem 
as above, the coordinate-wise XOR problem. 

Theorem 1.2. The coordinate-wise XOR problem requires communication Q{nk) in the coordinator model. 

Note that this theorem actually follows from Theorem 11.11 since the blackboard model is stronger than 
the coordinator model. However, the proof we sketch here is useful as a warmup exercise, since it shows an 
easy example of the symmetrization technique as applied to the coordinator model. In the paper we prove 
multiple lower bounds using this technique, most of which do not follow from corresponding bounds in the 
blackboard model. 



'Here and throughout the paper, symmetric means that the inputs are drawn from a distribution where renaming the players does 
not change the distribution. Namely, a distribution D over X" is called symmetric if exchanging any two coordinates in D keeps 
the distribution the same. 



To prove this theorem, we use the same hard distribution as above: all inputs are uniform and independent. 
Let P be a protocol in the coordinator model, which computes the coordinate-wise XOR, uses communica- 
tion C{P) and for now assume that P never makes any en^ors. As before, we build a new protocol P' for 
a 2-player problem. In the 2-player problem, suppose that Alice gets input x and Bob gets input y, where 
re, y € {0, 1}" are independent random bitvectors. Then P' works as follows: Ahce and Bob choose a 
single index i G {1, . . . ,k} from public randomness. Alice and Bob simulate the protocol P, where Alice 
simulates player i and lets Xi = x, and Bob plays all the rest of the players, including the coordinator, and 
chooses their inputs uniformly, conditioned on their XOR being equal to y. 

To simulate the protocol P, whenever player i needs to send a message to the coordinator, then Alice 
sends a message to Bob, and whenever the coordinator needs to send a message to player i, Bob sends 
a message to Alice. Whenever any player j ^ i needs to speak to the coordinator, no communication 
is needed, since both are played by Bob. Note again that the distribution of the inputs of the k players 
is independent uniform (for this it is crucial to remember that x and y were uniform and independent in 
the first place). Once again, from reasons of symmetry, since the index i was chosen uniformly and the 
inputs are symmetric, the expected communication performed by the protocol P' is E[C(P')] < 2C{P)/k. 
Furthermore, at the end of the running of P' , Alice knows the value of x © ?/ so she can reconstruct the 
value of y. As before, this implies the theorem. The assumption that we never make errors can once again 
be easily removed. 

1.1.1 Discussion 

We see that the crux of the symmetrization technique in the coordinator model is to consider the A; -player 
problem that we wish to lower-bound, to find a symmetric distribution which seems hard for it, to give Alice 
the input of one player (chosen at random) and Bob the input of all other players, and to prove a lower 
bounds for this two-player problem. If the lower bound for the two player problem is L, the lower bound 
for the A; -player problem will be kL. For the blackboard model, the proofs have the same outhne, except in 
the 2-player problem Alice gets the input of one randomly chosen player. Bob gets the input of another, and 
they both get the inputs of all the rest of the players. There is one important thing to note here: 

• This argument only works when the hard distribution is symmetric. 

1.2 Motivation, Previous Work and Related Models 

Communication complexity is a widely studied topic. In multiplayer communication complexity, the most 
studied mode of communication is the blackboard model. The message-passing model was already consid- 
ered in |[T5l . (This model can also be called the private-message model, but note that this name was used 
in lfT8l[T6l for a different model.) The coordinator model can be thought of as a server-site setting a where 
there is one server and k sites. Each site has gathered n bits of information and the server wants to evalu- 
ate a function on the collection of these k ■ n bits. Each site can only communicate with the server, and a 
server can communicate with any site. This server-site model has been widely studied in the databases and 
distributed computing communities. Work includes computing top- A; ||7l|26l|29l and heavy hitters |[36l[T9l . 
Another closely related model is the distributed streaming model, in which we also have one server and 
k sites. The only difference is that now the computation is dynamic. That is, each site receives a stream of 
elements over time and the server would like to maintain continuously at all times some function / of all 
the elements in the k sites. Thus the server-site model can be seen as a one-shot version of the distributed 
streaming setting. It follows that any communication complexity lower bound in the message-passing model 
or the coordinator model also hold in the distributed streaming model. A lot of work on distributed streaming 



This terminology is similar as the standard "client-server", and is used extensively in the literature. 



has been done recently in the theory community and the database community, including maintaining random 
samplings Ha, frequency moments SHIl, heavy hitters ||4l|22l|24l[3l, quantiles EOlIMI, entropy S, 
various sketches |[T3l[T0l and some non-linear functions |[3Tl[32l . 

We will come back to the latter two models in Section [6l 

It is interesting to note that despite the large number of upper bounds (i.e. algorithms, communication 
protocols) in the above models, very few lower bounds have been proved in any of those models, likely 
because there were few known techniques to prove such results. 

A further application of the message-passing model could be for Secure Multiparty Computation: in this 
model, there are several players who do not trust each other, but want to compute a joint function of their 
inputs, with each of them learning nothing about the inputs of the others players except what can be learned 
from the value of the joint function. Obviously, any lower bound in the message passing model immediately 
implies a lower bound on the amount of communication required for Secure Multiparty Computation. For 
more on this model, see e.g. ifTTl . 

One final application is for the streaming model. In this model, there is a long stream of data that can only 
be scanned from left to right. The goal is to compute some function of the stream, and minimize the space 
usage. It is easy to see that if we partition the stream into k parts and give each part to a different player, 
then a lower bound of L on the communication complexity of the problem in the coordinator model implies 
a lower bound of L/k on the space usage. When t passes over the model are allowed, a lower bound of L in 
the coordinator model translates to a lower bound of L/tk in the streaming model. 

1.3 Our Results and Paper Outline 

Our main technical result in this paper are lower bounds of Q{nk) randomized communication for the 
bitwise A; -party AND, OR, and MAJ (majority) functions in the coordinator model. This can be found in 
Section [3] In the same section we prove some lower bounds for AND and OR in the blackboard model as 
well. Back to the coordinator model, we show that the connectivity problem (given k players with subgraphs 
on a common set of nodes, determine if it is connected) requires Q.{nk/ log^ k) communication. This is in 
Section m 

The coordinate- wise lower bounds imply lower bounds for the well-studied problems of heavy-hitters and 
e-kernels in the server-site model (or the other related models). We show any randomized algorithm requires 
at least J7(/c/e('^~^)/^) and Vl{n/e) communication, respectively. The former is shown to be tight. This is in 
Section [6] 

We give some direct-sum-lrke results in Section [5] 

2 Preliminaries 

In this section we review some basic concepts and definitions. 

We denote [n] ={!,..., n}. All logarithms are base-2 unless noted otherwise. 

Communication complexity. Consider two players Alice and Bob, given bit vectors A and B, respec- 
tively. Communication complexity (see for example the book |[23l ) bounds the communication between 
Alice and Bob that is needed to compute some function f{A, B). The communication complexity of a pai^- 
ticular protocol is the maximal number of bits that it communicated, taken in the worst case over all pairs of 
inputs. The communication complexity of the problem / is the best communication complexity of P, taken 
over all protocols P that coiTcctly compute /. 

Certain functions (such as / = EQ which determines if A equals B) can be computed with less commu- 
nication if randomness is permitted. Let R^{f) denotes the communication complexity when the protocol is 



allowed to make a mistake with probability e. The error is taken over the randomness used by the protocol. 

Sometimes we are interested in the case where the input of Alice and Bob is drawn from some distribution 
fi over pairs of inputs. We want to allow an error s, this time taken over the choice of the input. The 
worst-case communication complexity in this case is denoted by D^ {f). Yao |f33l showed that R^{f) > 
max^D^(/), thus in order to prove a lower bounds for randomized protocols it suffices to find a hard 
distribution and prove a distributional lower bound for it. This is called the Yao Minimax Principle. 

In this paper we use an uncommon notion of expected distributional communication complexity. In this 
case we consider the distributional setting as in the last paragraph, but this time consider the expected cost 
of the protocol, rather than the worst-case cost; again, the expectation is taken over the choice of input. In 
an abuse of notation, we denote this E[L)^(/)]. 

2.1 Two- Party Lower Bounds 

We state a couple of simple two-party lower bounds that will be useful in our reductions. 



2-BITS. Let C,p be a distribution over bit-vectors of length n, where each bit is 1 with probability p and 
with probability 1 — p. In this problem Alice gets a vector drawn from C,p, Bob gets a subset S of [n] of 
cai^dinality \S\ = m, and Bob wishes to learn the bits of Alice indexed by S. 
The proof of the following lemma is in Appendix lAl 

Lemma 2.1. E[i?J/^ (2-6/75)] = n{nplog{l/p)). 

2-DISJ. In this problem Alice and Bob each have an n-bit vector. If we view vectors as sets, then each of 
them has a subset of [n] corresponding to the 1 bits. Let x be the set of Alice and y be the set of Bob. It is 
promised that jx n y| = 1 or 0. The goal is to return 1 if x n y / 0, and otherwise. 

We define the input distribution p as follows. Let / = (n + l)/4. With probability 1/t, x and y are 
random subsets of [n] such that \x\ = \y\ = I and jx n y| = 1. And with probabihty 1 — 1/t, x and y are 
random subsets of [n] such that |x| = \y\ = I and x n y = 0. Razborov ll30l (see also ll20l ) proved that for 
t = 4, DfJ (2-DISJ) = n{n). In the following theorem we extend this result to general t and also to 
the expected communication complexity. In Section [3TT] we only need i = 4, and in Section |4]we will need 
general t. 

The proof for the following lemma is in Appendix iBl 

Lemma 2.2. When p has \xny\ = 1 with probability 1/t then E D/^/ {2-DISJ) = $7(n). 



3 Bit-wise Problems 

3.1 Multiparty AND/OR 

We now consider multiparty AND/OR (below we use /s-AND and k-OR for short). In the fc-AND problem, 
each player i {I < i < k) has an 77,-bit vector Ij and we want to establish the bitwise AND of /j, that 
is, fj{Ii, . . . ,Ik) = Ai ^i,j for J = {!)•••) '^I- ^-OR is similarly defined with OR. Observe that the two 
problems are isomorphic by fj{Ii, . . . , Ik) = "'djih, ■ ■ ■ , Ik) for J = {Ij • • • > "^}> where Ij is obtained by 
flipping all bits of I^. Therefore we only need to consider one of them. Here we discuss k-OR. 



3.1.1 Idea for the A;-OR Lower Bound in the Coordinator Model 

We now discuss the hard distribution and sketch how to apply the symmetrization technique for the k-OR 
problem in the coordinator model. The formal proof can be found in the next subsection. 

We in fact start by describing two candidate hard distributions that do not work. The reasons they do not 
work are interesting in themselves. Throughout this subsection, assume for simplicity that k > 100 log n. 

The most natural candidate for a hard distribution is to make each entry equal to 1 with probability 1/k. 
This has the effect of having each bit in the output vector be roughly balanced, which seems suitable for 
being a hard case. This is indeed the hard distribution for the blackboard model, but for the coordinator 
model (or the message-passing model) it is an easy distribution: Each player can send his entire input 
to the coordinator, and the coordinator can figure out the answer. The entropy of each player's input is 
only Q{n\ogk/k), so the total communication would be ©(nlogA;) in expectation using e.g. Shannon's 
coding theorem!^ this is much smaller than the lower bound we wish to prove. Clearly, we must choose 
a distribution where each player's input has entropy f](n). This is the first indication that the /c-player 
problem is significantly different than the 2-player problem, where the above distribution is indeed the hard 
distribution. 

The next candidate hard distribution is to randomly partition the n coordinates into two equal-sized sets: 
The important set, where each entry is equal to 1 with probability 1/k, and the balancing set, where all 
entries are equal to 1. Now the entropy of each player's input is Q{n), and the distribution seems like a good 
candidate, but there is a surprising upper bound for this distribution: the coordinator asks 100 log n players 
to send him their entire input, and from this can easily figure out which coordinates are in the balancing set 
and which are in the important set. Henceforth, the coordinator knows this information, and only needs to 
learn the players' values in the important set, which again have low entropy. We would want the players to 
send these values, but the players themselves do not know which coordinates are in the important set, and 
the coordinator would need to send nk bits to tell all of them this information. However, they do not need to 
know this in order to get all the information across: using a protocol known as Slepian-Wolf coding (see e.g. 
|[T4l ) the players can transmit to the coordinator all of their values in the important coordinates, with only 
n log k total communication (and a small probability of error). The idea is roughly as follows: each player 
Pi chooses lOOn log k/k sets Sij C [n] independently and uniformly at random from public randomness. 
For each j, the player XORs the bits of his input in the coordinates of Sij, and sends the value of this XOR 
to the coordinator. The coordinator already knows the balancing set, so he only has Q{nlog k/k) bits of 
uncertainty about player i's input, meaning he can reconstruct the player's input with high probability (say 
by exhaustive search). The upper bound follows. 

To get an actual hard distribution, we fix the hard distribution from the last paragraph. We randomly 
partition the n coordinates into two equal-sized sets: The important set, where each entry is equal to 1 with 
probability 1/n, and the noise set, where each entry is equal to 1 with probability l/2o Clearly, each player's 
input has entropy G(n). Furthermore, the coordinator can again cheaply figure out which coordinates are in 
the important set and which are in the noise set, but the players do not know this information, and nothing 
like Slepian-Wolf coding exists to help them transmit the information to the coordinator. The distribution 
that we use in our formal proof is a little different than this, for technical reasons, but this distribution is hard 
as well. 

We now sketch how to apply the symmetrization technique to prove that this distribution is indeed hard. 
To apply the symmetrization technique, we imagine giving Alice the input of one of the players, and to Bob 



''To show the upper bounds in this subsection we use some notions from information theory without giving complete background 
for them. The reader can refer to e.g. 1141 . or alternatively can skip them entirely, as they are inconsequential for the remained of 
the paper and for understanding the symmetrization technique. 

''We could have chosen each entry in the important set to be equal to 1 with probability 1/k as well, but choosing a value of 
1/n makes the proofs easier. The important thing is to choose each of the noise bits to be equal to 1 with probability 1/2. 



the input of all of the others. Bob plays the coordinator, so Bob needs to compute the output; the goal is 
to prove a lower bound of Q{n) on the communication complexity between Alice and Bob. What can Bob 
deduce about the answer? He can immediately take the OR of all the vectors that he receives, getting a vector 
where with good probability all of the noise bits are equal to 1. (Recall that we assumed that the number of 
players is A; > 100 log n.) Among the important bits, roughly one of Alice's bits is equal to 1, and the goal 
is to discover which bit it is. Alice cannot know which coordinates ai^e important, and in essence we are 
trying to solve a problem very similar to Set Disjointness. A lower bound of r2(n) is easy to get convinced 
of, since it is similar to the known lower bounds for set disjointness. Proving it is not entirely trivial, and 
requires making some small modifications to Razborov's classical lower bound on set disjointness OOl . 

We see that the lower bound for this distribution has to (implicitly) rule out a Slepian-Wolf type upper 
bound. This provides some evidence that any lower bound for the A;-OR problem would have to be at least 
somewhat non-trivial. See further discussion in Section |7?T1 

3.1.2 The Proof 

We prove the lower bound on A;-OR by performing a reduction from the promise version of the two-party set 
disjointness problem (2-DISJ) which we lower-bounded in Lemma l2!2l Given an (x,y) for 2-DISJ drawn 
from the hard distribution /i, we construct an input for k-OR. Note that our mapping is not necessarily 
one-to-one, that is why we need a lower bound on the expected distributional communication complexity of 
2-DISJ. 

Reduction. We start with Alice's input set x and Bob's input set y from the distribution /i, with t = 4. 
That is, both x and y have / = n/A 1 bits chosen at random under the condition that they intersect at one 
point with probability 1/4, otherwise they intersect at no point. For the 2-DISJ problem, we construct k 
players' input sets /i, . . . , /jt as follows. Let z = [n] — y. Let S\, . . . , S\^he. random subsets of size / from 
z, and ^2"^ , . . . , S^^ be random subsets of size / — 1 from z. Let S\, . . . ^S\he random elements from y. 

Ii = X 

^^■(^ = ''---'^) = l5J-iu4 w.p.1/4 

Let i_l' be this input distribution for A:-OR. If /_, (2 < j < k) contains an element 5j, then we call this 
element a special element. 

This reduction can be interpreted as follows: Alice simulates a random player Ii, and Bob, playing as 
the coordinator, simulates all the other k — 1 players I2, ■ ■ ■ ,Ik- Bob also keeps a set V containing all the 
special elements that ever appear in Ij (j = 2, . . . , k). It is easy to observe the following fact. 

Lemma 3.1. All Ij (j = 1, . . . ,k) are chosen from the same distribution. 

Proof. Since by definition Ij {j = 2, . . . ,k) are. chosen from the same distribution, we only need to show 
that Ii = X under fi is chosen from the same distribution as any Ij is under f.i'. Given y, note that a:; is a 
random set of size lin z = [n\—y with probability 1/4; and with the remaining probability x is the union of 
a random set of size / — 1 in z along with a single random element from y. This is precisely the distribution 
of each Ij for j > 2. D 

The following lemma shows the properties of our reduction. 

Lemma 3.2. If there exists a protocol V' for k-OR on input distribution fi' with communication complexity 
C and error bound e, then there exists a protocol V for the 2-DISJ on input distribution jjl with expected 
communication complexity 0{C/k) and error bound e + Ak/n. 



Proof. Let us again view Ij (j = 1, . . . , A;) as n-bit vectors. We show how to construct a protocol V for 
2-DISJ from a protocol V' for k-OR with the desired communication cost and error bound. V is constructed 
as follows: Alice and Bob first run "P' on Ii, . . . , I^. Let W C [n] be the set of indices where the results 
are 1. Bob checks whether there exists some w ^ W r\y such that w ^ V. If yes, then V returns "yes", 
otherwise V returns "no". 

We start by analyzing the communication cost of V. Since player /i is chosen randomly from the k 
players, and Lemma [3TT] that all players' inputs are chosen from a same distribution, the expected amount of 
communication between /i (simulated by Alice) and the other k — 1 players (simulated by Bob) is at most 
a 2/k fraction of the total communication cost of V. Therefore the expected communication cost of V is at 
most 0{C/k). 

For the error bound, we have the following claim: With probability at least (1 — Ak/n), there exists a 
w &W f^y such that w ^V if and only if rr n y 0. First, if x n y = 0, then Ii = x cannot contain any 
element w ^ V Q y, thus the resulting bits mW r\y cannot contain any special element that is not in V. 
On the other hand, we have Pr[((Ty n y) C F) A (x n y / 0)] < Ak/n. This is because {{W Pi y) C y) 
and (x n y 7^ 0) hold simultaneously if and only if there exist some S^ {1 < j < k) such that 5"^ G x n y. 
According to our random choices of 5^ {j = 1, . . . ,k), this holds with probability at most k/l < Ak/n. 
Therefore if V' errors at most e, then V errors at most e + Ak/n. D 

Combining Lemma |2!2l and Lemma 1X2] we have the following theorem. 

Theorem 3.1. D{ {k-OR) = VL{nk),for n > 3200A; in the coordinator model. 

Proof. If there exists a protocol V' that computes A;-OR on input distribution /u' with communication com- 
plexity o{nk) and error bound 1/800, then by Lemma [3^ there exists a protocol V that computes 2-DISJ on 
input distribution ^ with expected communication complexity o{n) and eiTor bound l/800+4A;/?i < 1/400, 
contradicting Lemma |2!2] ( when t = A). D 

3.2 Multiparty AND/OR with a Blackboard 

Denote the A;-OR problem in the blackboard model by fe-OR-board. The general idea to prove a lower bound 
for /c-OR-board is to perform a reduction from a 2-party bit-wise OR problem (2-OR for short) with public 
randomness. The 2-OR problem is the following. Alice and Bob each have an n-bit vector drawn from the 
following distribution: Each bit is 1 with probability 1/k and with probability 1 — 1/k. They also use 
public random bits to generate another {k — 2) n-bit vectors such that each bit of these vectors is 1 with 
probability 1/k and with probability 1 — 1/k. That is, the bits are drawn from the same distribution as 
their private inputs. Let v be this input distribution. Alice and Bob want to compute bit- wise OR of all these 
k n-bit vectors. For this problem we have the following theorem. 

Theorem 3.2. E[L»y^(2-0/?)] = VL(n/k ■ \ogk). 

Proof. W.l.o.g., let us assume that Bob outputs the final result of 2-OR. It is easy to see that if we take 
the bit-wise OR of the {k — 2) n-bit vectors generated by public random bits and Bob's input vector, the 
resulting n-bit vector b will have at least a constant density of bits with probability at least 1 — o(l), by 
a Chernoff bound. Since Bob can see the k — 2 public vectors, to compute the final result, all that Bob has 
to know ai^e bits of Alice's vector on those indices i {1 < i < n) where h[i] = 0. In other words, with 
probability at least 1 — o(l). Bob has to learn a specific, at least constant fraction of bits in Alice's vector up 
to error 1/3. Plugging in Lemma |2?T] with p = 1/k, we know that the expected communication complexity 
is at least (1 - o(l)) • 9.{n/k ■ log k) = 9.{n/k ■ log A;). D 



We reduce this problem to A;-OR-board as follows: Alice and Bob each simulates a random player, and 
they use public random bits to simulate the remaining k — 2 players: The {k — 2) n-bit vectors generated by 
their public random bits are used as inputs for the remaining k — 2 random players. Observe that the input 
of all the k players are drawn from the same distribution, thus the k players are symmetric. Consequently, 
the expected amount of communication between the two random players simulated by Alice and Bob and 
other players (including the communication between the two random players) is at most 0{l/k) faction of 
the total communication cost of protocol for A:-OR-board. We have the following theorem. 

Theorem 3.3. dV^ {k-OR-board) = Vt{n ■ log A;). 

Proof. It is easy to see that if we have a protocol for the fc-OR-board on input distribution u with com- 
munication complexity o{n • log A;) and error bound 1/3, then we have a protocol for 2-OR on input dis- 
tribution V with expected communication complexity o{n/k • log A;) and error bound 1/3, contradicting 
Theorem 13.21 D 



It is easy to show a tight deterministic upper bound of 0(n log fc) for this problem in the blackboard 
model: each player speaks in turn and writes the coordinates where he has 1 and no player that spoke before 
him had 1. 

3.3 Majority 

In the fe-MAJ problem, we have k players, each having a bit vector of length n, and they want to compute 
bit-wise majority, i.e. to determine for each coordinate whether the majority of entries in this coordinate 
are 1 or 0. We prove a lower bound of Q{nk) for this problem in the coordinator model by a reduction to 
2- BITS via symmetrization. 

For the reduction, we consider k = 2t + 1 players and describe the input distribution r as follows. For 
each coordinate we assign it either t or (t+ 1), each with probability 1/2, independently over all coordinates. 
This indicates whether the k players contain t or (t + 1) 1 bits among them in this coordinate, and hence 
whether that index has a majority of or 1, respectively. Then we place the either t or (t + 1) 1 bits randomly 
among the k players inputs in this coordinate. It follows that under r: (i) each player's bits are drawn from 
the same distribution, (ii) each bit of each player is 1 with probability 1/2 and with probability 1/2, and 
(iii) each index has probability 1/2 of having a majority of Is. 

Theorem 3.4. Dr (k-MAJ) = n{nk) in the coordinator model. 

Proof. Now we use symmetry to reduce to the two-player problem 2- BITS. Alice will simulate a random 
player under r and Bob simulate the other A; — 1 players. Notice that, by (ii) any subset of n' indices of 
Alice's ai^e from Ci/2- We will show that Alice and Bob need to solve 2-BITS on Q{n) bits to solve A:-MAJ. 
And, by (i), since all players have the same distribution and Alice is a random player, her expected cost in 
communicating to Bob is at most 0{C/k) if A;-MAJ can be solved in C communication. 

Now consider the aggregate number of 1 bits Bob has for each index; he has (t — 1) 1 bits with probability 
1/4, t 1 bits with probability 1/2, and (t + 1) 1 bits with probability 1/4. Thus for at most (3/4)n indices 
(with probability at least 1 — exp(— 2(n/4)^/n) = 1 — exp(— n/8) > 1 — 1/7) that have either (t — 1) or 
(t + 1) 1 bits Bob knows that these either will or will not have a majority of 1 bits, respectively. But for the 
other at least n/4 = J7(n) remaining indices for which Bob has exactly 1 1 bits, whether or not these indices 
have a majority of 1 bits depends on Alice's bit. And conditioned on this situation, each of Alice's relevant 
bits are 1 with probability 1/2 and with probability 1/2, hence distributed by (^x/2- Thus conditioned on 
at least n/4 undecided indices, this is precisely the 2-BITS problem between Alice and Bob of size n/4. 

Thus a protocol for k-MAJ in o(nA;) communication and error bound 1/6 would yield a protocol for 2- 
BITS in expected o{n) communication and eiTor bound 1/6+1/7 < 1/3, by running the protocol simulated 



between Alice and Bob. This contradicts Lemma l2?Tl and proves that fc-M AJ requires O(fcn) communication 
when allowing eiTor on at most 1/6 fraction of inputs. D 

Extensions. This lower bound can easily be extended beyond just majority (threshold 1/2) to any constant 
threshold cl){0 < cp < I), by assigning to each coordinate either [kcl)\ or {[k(j)\ + 1) 1 bits with probability 
1/2 each. Let r^ denote this distribution. Then the analysis just uses (^ in place of (^^ /2' which also yields 
an 9,{n) lower bound for 2-BITS. We call this extended A;-MAJ problem (k, 0)-MAJ. 

1 /fi 

Corollary 3.1. Dr, {{k, (f))-MAJ) = fl{nk) for any constant 0(0 < < 1) in the coordinator model. 
We discuss the applications of this lower bound to the heavy -hitter problem in Section [6l 

4 Graph Connectivity 

In the fc-CONN problem, we have k players, each having a set of edges in an n-vertex graph. The goal is to 
decide whether the graph consisting of the union of all of these edges is connected. In this section we prove 
an Q.{nk/\og^k) lower bound for A;-CONN in the coordinator model, by performing a symmetry-based 
reduction from 2-DISJ. 

4.1 Proof Idea and the Hard Distribution 

Let us start by discussing the hard distribution. Assume for simplicity k > 100 log n. We describe a hard 
distribution which is not quite the same as the one in the proof (due to technical reasons), but is conceptually 
clearer. In this hard distribution, we consider a graph G, which consists of two disjoint cliques of size n/2. 
We also consider one edge between these two cliques, called the connector, the connector is not part of 
G. Each player gets as input n/10 edges randomly and uniformly chosen from the graph G; furthermore, 
with probability 1/2 we choose exactly one random edge in one random player's input, and replace it by 
the connector. It is easy to see that if one of the players got the connector, then with high probability the 
resulting set of edges span a connected graph, and otherwise the graph is not connected. 

To get convinced that the lower bound holds, notice that the coordinator can easily reconstruct the graph 
G. However, in order to find out if one of the players has received the connector, the coordinator needs to 
speak with each of the players to find this out. The situation is roughly analogous to the situation in the k-OR 
problem, since the players themselves did not get enough information to know G, and no Slepian-Wolf type 
protocol is possible since the edges received by each player are random-looking. The actual distribution 
that we use is somewhat more structured than this, in order to allow an easier reduction to the 2-player 
disjointness problem. 

4.2 The Proof 

We first recall 2-DISJ. Similarly to before, in 2-DISJ, Alice and Bob have inputs x {\x\ = £) and y {\y\ = £) 
chosen uniformly at random from [n] (n = 4^—1), with the promise that with probability 1/lOA;, Ix n y | = 1 
and with probability 1 — 1/lOk, |x n y| =0. Let ip be this input distribution for 2-DISJ. Now given an input 
(x, y) for 2-DISJ, we construct an input for A;-CONN. 

Let K2n = {V, E) be the complete graph with 2n vertices. Given Alice's input x and Bob's input y, we 
construct k players' input Ii, . . . ,Ik such that \Ij\ = £ and Ij C i? for all 1 < j < A:. We first pick a random 
permutation a of [2n\. Alice constructs /i = {{a{2i — l),a{2i)) \ i G x}. 

Bob constructs /2, ■ ■ ■ , -^fc- It is convenient to use a and y to divide V into two subsets L and R. For each 
« (1 < « < n), if z G y, then with probability 1/2, we add a{2i — 1) to L and a{2i) to R; and with the 
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rest of the probability, we add a{2i — 1) to R and a{2i) to L. Otherwise if i y, then with probabihty 
1/2, we add both a{2i — 1) and a{2i) to L; and with the rest of the probability, we add both a{2i — 1) 
and a{2i) to R. Let Kj^ = {L, El) and Kji = {R, Er) be the two complete graphs on sets of vertices L 
and R, respectively. Now using Er and El, Bob can construct each Ij. With probability 1 — l/lO/c Ij is a 
random subset of disjoint edges (i.e. a matching) from El U Er of size t, and with probability l/lO/c, Ij is 
a random subset of disjoint edges from El U Er of size ^ — 1 and one random edge from E \ {El U Eli). 

Let if' be the input distribution for /c-CONN defined as above for each Ij (1 < j < k). We define the 
following two events. 

^i: Both edge-induced subgraphs U^^2-^j Pi ^L and U^^2^j C\ ^R ^^ connected, and span L and R re- 
spectively. 

^2^ U^^2-(j' n -^ is ""^ '^onn^'^ted. 

It is easy to observe the following two facts by our construction. 

Lemma 4.1. All Ij {j = 1, . . . ,k) are chosen from the same distribution. 

Proof. Since o" is a random permutation of [2n], according to the distribution Lp' , Bob's input can also 
seen as follows: With probability 1 — l/lO/c, it is a random matching of size i. from El U Er; and with 
probability 1/lOA;, it is a matching consists of £ — 1 random edges from El U Eli and one random edge 
from E\{El U Er), which is {a{2z - 1), o-(2z)) where z = xr\y. D 

Lemma 4.2. ^i happens with probability at least 1 — l/2n when k > 68 In n + 1. 

Proof. (This is a proof sketch; full proof in Appendix ICl) First note that by our construction, both \L\ and 
\R\ are Q{n) with high probability. To locally simplify notation, we consider a graph {V, E) of n nodes 
where edges are drawn in (A: — 1) > 68 Inn rounds, and each round n/4 disjoint edges are added to the 
graph. If (y, E) is connected with probability (1 — l/4n), then by union bound over U^^2^j fl ^L and 
U^'^2^i n -^fi' Ci is true with probability (1 — l/2?i). The proof follows four steps. 

(51) We show that all points have degree at least 8 Inn with probability at least 1 — l/12n; this uses the 
first 28 In n rounds. 

(52) We show (conditioned on (SI)) that any subset S C V oi h < n/10 points is connected to at least 
min{/i In ?i, n/10} distinct points in y \ S, with probability at least 1 — 1/n^. 

(53) We can iterate (S2) In n times to show that there must be a single connected component Sq of size at 
least n/10, with probability at least 1 — l/12n. 

(54) We can show (conditioned on (S3) and using the last 40 In n rounds) that all points are connected to 
Sg with probability at least 1 — l/12n. D 

The following lemma shows the properties of our reduction. 

Lemma 4.3. Assume k > 100 log n. If there exists a protocol V' for k-CONN on input distribution ip' 
with communication complexity C and error bound e, then there exists a protocol V for the 2- D IS J on input 
distribution ip with expected communication complexity 0{C /k ■ log k) and error bound 2e + 1/2000/c. 

Proof. In V, Alice and Bob first construct {/i , . . . , J^} according to our reduction, and then run the protocol 
V' on it. By Lemma l4!2l we have that ^i holds with probability at least 1 — l/2n. And by our construction, 
conditioned on that ^i holds, ^2 holds with probability at least (1 — l/lOA:)'^"^ > 1 — 1/10. Thus the input 
generated by a random reduction encodes the 2-DISJ problem with probability at least (1 — l/2n — 1/10) > 
8/9. We call such an input a good input. We repeat the random reduction clogk times (for some large 
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enough constant c) and run V' on each of the resuhing inputs for fe-CONN. The probabiUty that we have at 
least one good input is at least 1 — 1/2000A; (by picking c large enough). If this happens, Bob randomly pick 
an input among those good ones and the result given by running protocol V' on that input gives the correct 
answer for 2-DISJ with probability at least 1 — 2e, by a Markov inequality, since by our reduction at least a 
8/9 fraction of inputs under ip' are good and V' has error bound e under Lp' . Therefore we obtained a protocol 
V for 2-DISJ with expected communication complexity 0{C jk ■ log k) and error bound 2e + 1/2000/;:. D 

Combining Lemma |2!2l and Lemma [431 we have the following theorem. 

Theorem 4.1. Assume k > 100 log n. Then D^J,^^^^''{k-CONN) = n{nk/ log k) 

Proof. If there exists a protocol V' that computes A;-CONN with communication complexity o(nfc/ log k) 
and eiTor 1/4000A;, then by Lemma [l!2] there exists a protocol V that computes 2-DISJ with expected 
communication complexity o{n) and error at most (2 • 1/4000/c + l/2000/i:) = 1/lOOOfc, conti"adicting 
Lemma f22\ (when t = Wk). D 

Finally, we have i?i/3(^.(30NN) > 0(iiV^o°o'=(fc-CONN)/log/fc) > n{D^J,^°^°''{k-CONN)/logk) > 
Q.{nk/ log^ k) as an immediate consequence. 

5 Some Direct-Sum-Like Results 

Let f : X X y ^ Z he. nn arbitrary function. Let ^u be a probability distribution over X x y. Con- 
sider a setting where We have k + 1 players: Carol, and Pi,P2, . . . ,Pk- Carol receives an input from 
X £ X and each Pi receives an input y, G y. Let R^{f^) denote the randomized communication com- 
plexity of computing / on Carol's input and each of the k other players' inputs respectively; i.e., computing 
/(x, yi), /(x, 2/2), . . . , /(x, yfc). Our direct-sum-like theorem in the message-passing model states the fol- 
lowing. 

Theorem 5.1. In the message-passing model, for any function f : X x y ^ Z and any distribution ^ on 
X xy,we have R'{f^) > n{k ■ E[L»^(/)]). 

Note that this is not a direct-sum theorem in the strictest sense, since it relates randomized communi- 
cation complexity to expected distributional complexity. However, it should probably be good enough for 
most applications. The proof of this theorem is dramatically simpler than most direct-sum proofs known 
in the literature (e.g. 161). This is not entirely surprising, as it is weaker than those theorems: it deals with 
the case where the inputs are spread out over many players, while in the classical direct-sum setting, the 
inputs are only spread out over two players (this would be analogous to allowing the players Pi to com- 
municate with each other for free, and only charging them for speaking to Carol). However, perhaps more 
surprisingly, optimal direct-sum results are not known for most models, and are considered to be central 
open questions, while the result above is essentially optimal. Optimal direct-sum results in 2-player models 
would have dramatic consequences in complexity theory (see e.g. f2\\ as well as |[28l ). so it seems inter- 
esting to check whether direct-sum results in multiparty communication could suffice for achieving those 
complexity-theoretic implications. 

Proof. Given the distribution pon X xy,we construct a distribution u on X xy''. Let px be the marginal 
distribution on 3^ induced by /i condition on X = x. We first pick {x,yi) (^ X xy according to /i, and then 
pick y2, ■ ■ ■ , ijk independently from 3^ according to px . We show that Dl{f^) > ^{k • E[D^(/)]). The 
theorem follows by Yao's min-max principle. 

Suppose that Alice and Bob get inputs {u, w) from X x y according to /x. We can use a protocol for f'^ 
to compute f{u, w) as follows: Bob simulates a random player in {Pi, . . . , P^}. W.l.o.g, say it is Pi. Alice 
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simulates Carol and the remaining k — I players. The inputs for Carol and Pi, . . . ,Pk are constructed as 
follows: X = u, yi = w and 2/2, . . . , yfc are picked from 3^ according to pu (Alice knows u and fi so she can 
compute pu). Let u be the distribution of (x, yi, . . . , t/k) in this construction. We now run the protocol for 
f^ onx,yi, . . . , yk- The result also gives /(n, w). 

Since yi, . . . , y^ ai^e chosen from the same distribution and Pi is picked uniformly at random from the 
k players other than Cai^ol, we have that in expectation, the expected amount of communication between 
Pi and {Carol, P2, ■ ■ ■ ,Pk}, or equivalently, the communication between Alice and Bob according to 
our construction, is at most a 2/k fraction of the total communication of the {k + l)-player game. Thus 

Di{f)>n{k.nD',{f)]). □ 

5.1 With Combining Functions 

In this section we extend Theorem 15. II to the complexity of the AND/OR of k copies of 0/1 function /. 
Similar to before, AND and OR ai^e essentially the same so we only talk about OR here. Let P^(/qpj) 
denote the randomized communication complexity of computing f{x,yi) V /(x, 2/2) V ... V /(x, y^). 

Theorem 5.2. In the message-passing model, for every function f : X xy ^ {0, 1} and every distribution 
ponX xy such that p{f-^{l)) < 1/Wk, wehave R^/^{f^^) > f7(A;/log(l/e) • E[P)^(/)]). 

Proof. The proof is similar as that for Theorem 15. II The reduction is the same as that in the proof of Theo- 
remO l.&tDl''^{f^^) = C. Note that if /^(/-^(l)) < 1/lOA;, then with probability (l-l/10/c)*=~i > 0.9, 
we have fQ^{x, yi, . . . , y^) = f{u, w). We can boost this probability to (1 — e/2) by repeating the reduction 
for 0(log 1/e) times and then focus on a random good input (if any), as we did in the proof of Lemma |431 
The A: -player protocol on a random good input succeeds with probability at least 1 — 2 • e/4, by a Mai^kov 
inequality. Therefore we have a protocol for / with expected communication complexity 0(log 1/e • C/k) 
and error bound (2 • e/4 + e/2) = e. D 

It would be interesting to generalize this to other combining functions such as majority, XOR and others. 

One can also use symmetrization to prove similar direct-sum results for other settings. One such setting 
is when there are 2k players: k players that get xi, . . . ,Xk and k players that get yi, . . . ,yk, and the goal is 
to compute /(xi, yi), /(x2, y2), • • • , f{xk, yk)- Another setting is the same except that there are only k-\-l 
players, and Carol receives all of the inputs xi, . . . , x^. We omit the details here. Versions of direct-sum in 
the blackboard model are also possible for some of these settings. 

6 Applications 

We now consider applications where multiparty communication complexity lower bounds such as ours are 
needed. As mentioned in the introduction, our multiparty communication problems are strongly motivated 
by research on the server-site model and the more general distributed streaming model. We discuss two 
problems here, the heavy hitters problem which asks to find the approximately most frequently occurring 
elements in a set which is distributed among many clients, and the e-kemel problem which asks to approxi- 
mate the convex hull of a set which is distributed over many clients. 

e-Kernels. Given a set of n points P C R'^, the width in direction u is denoted by 

•\d(P,u) = ( niax(p, u) ) — ( min(p, u) I , 
\p&p J \peP ) 
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where (•, •) is the standard inner product operation. Then an e-kemel HKH i^ is a subset of P so that for 
any direction u 

wid (P, u) —\N\d{K,u) < e • wid (P, n) . 

An e-kernel K approximates the convex hull of a point set P, such that if the convex hull of K is expanded 
in any direction by an e-factor it contains P. As such, this coreset has proven useful in many applications 
in computational geometry such as approximating the diameter and smallest enclosing annulus of point 
sets Em. It has been shown that e-kernels may require il(l/e''^~^"^) points (on a {d — l)-sphere in R'^) 
and can always be constructed of size 0(l/e*^'^~^)/^) in time 0{n + l/e'^"'^/^) OSllSl. 

We can note a couple of other properties about e-kernels. Composibility : If Ki , K2 , ■ ■ ■ , Kk are e-kemels 
of Pi, P2, . . . , Pfc, respectively, then K = IJi=i ^i i^ ^^ e-kernel of P = Uj=i ^i- Transitivity: If Ki is an 
ei-kernel of P and K2 is an e2-kernel of Ki, then K2 is an (ei + e2)-kernel of P. Thus it is easy to see that 
each site i can simply send an (e/2)-kemel K^ of its data of size Us = 0{l/e^^~^^^'^) to the server, and the 
server can then create and (e/2)-kernel of Ui=i ^i of si^^ 0{l/e^'^~^'''^). This is asymptotically optimal 
size for and e-kernel of the full distributed data set. We next show that this procedure is also asymptotically 
optimal in regards to communication. 

Theorem 6.1. For a distributed set of k sites, it requires ^{k/e^ '''^) communication total between the 
sites and the server for the server to create an e-kemel of the distributed data with probability at least 2/3. 

Proof. We describe a construction which reduces k-OR to this problem, where each of k players has n^ = 
0(l/e^'^~^^'^) bits of information. Theorem [STT] shows that this requires n{n^k) communication. 

We let each player have very similar data Pj = {pi,i, . . . ,Pi,ne}' each player's data points lie in a unit ball 
P = {g G R"^ I \\q\\ < 1}. For each player, their n^ points are in similar- position. Each player's jth point 
Pi J is along the same direction Uj, and its magnitude is either \\pij\\ = 1 or \\pij\\ = 1 — 2e. Furthermore, 
the set of directions U = {ui} aie well-distributed such that for any player i, and any point pij that Pj \ pij 
is not an e-kernel of Pi; that is, the only e-kernel is the full set. The existences of such a set follows from 
the known lower bound construction for size of an e-kernel. 

We now claim that the k-OR problem where each player has n^ bits can be solved by solving the dis- 
tributed e-kernel problem under this construction. Consider any instance of k-OR, and translate to the 
e-kemel problem as follows. Let the jth point pij of the ith player have norm \\pij\\ = 1 when jth bit of the 
player is 1, and have norm \\pij\\ = 1 — 2e if the jth bit is 0. By construction, an e-kernel of the full set must 
acknowledge (and contain) the jth point from some player that has such a point with norm 1, if one exists. 
Thus the full e-kernel encodes the solution to the k-OR problem: it must have n^ points and, independently, 
the jth point has norm 1 if the jth OR bit is 1, and has norm 1 — 2e if the jth OR bit is 0. D 

Heavy Hitters. Given a multi-set 5 that consists of ?i elements, a threshold parameter (/>, and an error 
parameter e, the approximate heavy hitters problem asks for a set of elements which contains all elements 
that occur at least (j) times in S and contains no elements that occur fewer than cl){l—e) times in S. On a static 
non-distributed data set this can easily be done with sorting. This problem has been famously studied in the 
streaming literature where the Misra-Gries |[27l and SpaceSaving |[25l summaries can solve the problem in 
0(l/e) space. In the distributed setting the best known algorithms use random sampling of the indices and 
require either 0((l/e^)nlogn) or 0{k + Vkn/e ■ logn) communication to guarantee a correct set with 
constant probability |[T9l . We will prove a lower bound of Q,{n/e) and leave tightening this bound as an 
open problem. 

We specify a specific formulation of the approximate heavy hitters problem as {k, (p, e)-HH as follows. 
Consider k players, each with a bit sequence (either or 1) of length n where each coordinate represents an 
element. The goal is to answer YES for each index with at least (j)k elements, NO for each index with no 
more than (/)A:(1 — e) elements, and either YES or NO for any count in between. The reduction is based on 
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a distribution r^ ^ where independently each index has either (pk or cl)k{l — e) 1 bits, each with probability 
1/2. In the reduction the players are grouped into sets of ke players each, and all grouped players for each 
index ai^e either given a 1 bit or all players are given a bit. These 1 bits are distributed randomly among 
the 1/e groups. The proof then uses Corollary 13. II 

Theorem 6.2. Dll^l{{k,(j),e)-HH) = n{n/e). 

Proof. To lowerbound the communication for {k, (p, e)-HH we first show another problem is hard: (1/e, (/))- 
HH (assume 1/e is an integer). Here there are only 1/e players, and each player at each index has a count of 
or ke, and we again want to distinguish between total counts of at least kcp (YES) and at most k(j){l — e) 
(NO). By distribution r^ ^ each index has a total of either kcp or A;(/)(l — e) exactly. And then we distribute 
these bits to players so each player has precisely either or ke at each index. When k is odd, this is precisely 
the (1/e, (/>)-MAJ problem, which by Corollary 13. II takes r2(n/e) communication. 

Now it is easy to see that Dr^^{{l/e, (^)-HH) < D^^ ^{{k, (j), e)-HH), since the former on the same input 
allows (1/e) sets of ke players to talk to each other at no cost. D 

7 Conclusions and Open Problems 

In this paper we have introduced the symmetrization technique, and have shown how to use it to prove lower 
bounds for A;-player communication games. This technique seems widely applicable, and we expect future 
work to find further uses. 

The symmetrization technique has several limitations, which we wish to discuss here. 

Firstly, there are problems that might be impossible to lower bound using symmetrization. Consider for 
example the fc-player disjointness problem, where each player gets a subset of {1, . . . ,n}, and the goal 
is to decide whether there is an element that appears in all of the sets. This problem is easier than the 
coordinate-wise AND problem. We believe that this problem has a communication lower bound of Q{nk) 
in the coordinator model. However, it seems impossible to prove this lower bound using symmetrization, 
for the following reason. Suppose we give Alice the input of a randomly-chosen player pi, and give Bob the 
inputs of all the other players. It seems that for any symmetric input distribution, this problem can be solved 
using 0(?i log k/k) bits in expectation, which is much lower than the il(n) lower bound we are aiming for. 
It is not out of the question that some variant of the symmetrization technique can be used to get the Q,{nk) 
lower bound, but we do not know how to do it, and it might well be impossible. We leave this problem as 
an intriguing open question. 

The second limitation is that symmetrization seems to require proving distributional lower bounds for 
2-player problems, often over somewhat-convoluted distributions. This presents some difficulty for the 
researcher, who needs to start proving lower bounds from scratch and cannot use the literature, since lower 
bounds in the literature are proved for other distributions. Yao's minimax principle cannot be used, since it 
only guarantees that there is some hard distribution, but it doesn't guarantee anything for the distribution of 
interest. This is often only a methodical difficulty, since it is often easy to get convinced that the distribution 
of interest is indeed hard, but a proof of this must still be found, which is a tedious task. It would be useful 
if these was some way to circumvent this difficulty, for example by finding a way that standard randomized 
lower bounds can be used. 

The third limitation is that in order to use symmetrization, one needs to find a hard distribution for the 
A;-player problem which is symmetric. This is usually impossible when the problem itself is not symmetric, 
i.e. when the players have different roles. For example, one could envision a problem where some of the 
players get as input elements of some group and the rest of the players get as input integers. However, note 
that for such problems, symmetrization can still be useful in a somewhat-generalized version. For example, 
suppose there ai^e two sets of players: in set P, the players get group elements, and in set P', the players 
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get integers. Assume each of the sets contains exactly k/2 players. To use symmetrization, we would try 
to find a hard distribution that is symmetric inside of P and also inside of P'; namely, a distribution where 
permuting the players inside P has no effect on the distribution, and similarly for permuting the players 
inside P' . Then, to use symmetrization we can have Alice simulate two random players, pi and pj, where 
Pi is from P and pj is from P'; Bob will simulate all the rest of the players. Now symmetrization can be 
applied. If, alternatively, the set P contained just 3 players and P' contained A; — 3 players, we can have 
Alice simulate one of the players in P' , Bob can simulate the rest of the players in P' , and either Alice or 
Bob can play the three players from P. As can be seen, with a suitable choice of distribution, it should still 
be possible to apply symmetrization to problems that exhibit some amount of symmetry. 

The main topic for future work seems to be to find more setting and problems where symmetrization can 
prove useful. Since discovering this approach we have found repeatedly that problems that we encounter in 
other settings seem susceptible to this approach, and we think it has the potential to be a widely useful tool. 

7.1 Are Our Results Trivial? 

The problems we deal with in this paper are simple and natural, possibly the simplest problems one could 
ask in the multiplayer communication model. We have not found results about them in the literature, but it 
makes sense to ask if the lower bounds we prove have trivial proofs: maybe the symmetrization framework 
that we introduce is not needed, and some of these results can be proved trivially? The analogues of these 
results in the 2-player case are indeed trivial to prove, so why should the /o -player case be any different? 

As with any mathematical result, it is difficult to show that our results do not have trivial proofs hiding 
somewhere. One argument is that we have tried hard to find such proofs, and failed. A good way to get 
convinced that the fc-player setting is significantly different than the 2-player setting is to attempt to argue 
about /c -player protocols. These protocols have much more freedom than 2-player protocols, and it is harder 
to argue about what they cannot do. 

Perhaps the best evidence that at least some of our results are non-tiivial can be found in Section 13.1.11 
In this section we discussed the coordinate-wise AND problem in the coordinator model, and have shown 
that some natural candidates for hard distributions are not hard at all, and that proving a lower bound for the 
problem seems to require ruling out a certain Slepian- Wolf-related type of protocols (see more details there). 
This gives some indication that a lower bound for this problem should be at least somewhat non-tiivial. 

7.2 List of Problems 

Symmetrization is a technique that can be applied to a wide range of /c-player communication problems, as 
well as having many possible applications in related models. Any problem which has a symmetric hard- 
looking distribution is a good candidate. Below is a list of candidate problems, some of which we lower- 
bound in this paper, and some that we leave for future work. This list is far from exhaustive. 

• Coordinate-wise problems: Each player gets a vector of length n. Some symmetric coordinate-wise 
function 5f:{0,l}'^— J-jOjljis applied, resulting in a length n vector. Then a "combining function" 
h : {0, 1}" — >• Z is applied to the bits of the result. Z could be any output domain. In the examples 
we dealt with in this paper, g was AND, OR, XOR or MAJ, and h was typically the identity function, 
/c -player disjointness is the case when g is AND and h is OR. We believe that for most choices of g 
and h that are non-constant symmetric functions, the complexity of this problem is Q{nk). It might 
be interesting to conjecture this holds for any such h and g. 

• Equality: Each player gets a vector of length n, and the goal is to decide whether all players have 
received the same vector. Another variant is to decide whether all players have received distinct 
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vectors. This problem should have a lower bound of Q{nk) in the coordinator model, with a proof 
using symmetrization; we defer the details to a later version. 

• Graph problems: Each player gets a set of m edges in an n- vertex graph, and the goal is to compute 
some graph property of the graph that consists of all of the edges in the input. In this paper we treat 
this problem when the graph property is connectivity. Obviously, many other properties could be 
interesting. 

• Pointer Chasing: Each player has a set of directed edges in an n-vertex graph, where for each vertex 
V, only one player has an edge whose source is v. The "sink" of the instance is the vertex obtained by 
starting at vertex number 1 and walking according to the directed edges until reaching a vertex with 
no outgoing edges. We assume there is always a sink, and the goal is to find it. 
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A Omitted Proof for 2-BITS 

Lemma III] (restated). E[L>J/^ (2-6/75)] = n{np\og{l/p)). 

Proof. Here we will make use of several simple tools from information theory. Given a random variable 
X drawn from a distribution /i, we can measure the amount of randomness in X by its entropy H{X) = 
— ^^ /i(x) log2 /i(x). The conditional entropy H{X \ Y) = H{XY) — H(Y) describes the amount of 
entropy in X, given that Y exists. The mutual information I{X : Y) = H{X) + H(Y) — H{XY) measures 
the randomness in both random variables X and Y . 

Let V be any valid communication protocol. Let X be Alice's (random) input vector. Let Y be Bob's 
output as the Alice's vector he learns after the communication. Let 11 be the transcript of V. Let e = 1/3 be 
the eiTor bound allows by Bob. First, since after running V, with probability at least 1 — e, we have Y = X, 
thus Bob knows X. Therefore 

I{X : y I n) < eH{X). 

Consequently, eH{X) > I{X ■Y\Ii) 

= H{x I n) + H{Y I n) - H{XY I n) 

= H{XU) - H(U) + H{YU) - H{U) - {H{XYU) - H{U)) 

= H{Xn) + H(YU) - H{XYU) - H{U) 

= I{XU : YU) - H{U) 

> (1 - e)H{XU) - H{n) 

> il-e)HiX)-HiU) 

Therefore, E[|n|] > H{U) > (1 - 2e)H{X) > n{7iH{p)) > n{nplog{l/p)). D 

B Omitted Proofs for the Biased 2-party Set Disjointness 

Lemma IX2l (restated). When p has \x Dyl = 1 with probability 1/t then E dJ {2-DISJ) = il(n). 

The proof is based on ll30l . Before giving the proof, we first introduce some notations and a key technical 
lemma. Define 

A = {{x,y) : {p{x,y)>0)A{xny = $)} 

and 

B = {{x,y) : ip{x,y)>0)/\{xny^fl>)}. 

Thus p{A) = 1 — l/t and p{B) = 1/t. We need the following key lemma. The proof will be given shortly. 
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Lemma B.l. Let A, B, /x be defined as above. Let R = C x D be any rectangle in the communication 
protocol. Then we have ii{B n R) > l/40t • fi{A n R) - 2-o°i'^. 

Proof, (for Lemma l2!2l) Let TZ = {Ri, . . . ,Rt} be the minimal set of disjoint rectangles in which the 
protocol outputs "1", i.e, x n y = 0. Imagine that we have a binary decision tree built on top of these 
rectangles. If we can show that there exists O C TZ such that /^(Ur go ^*) — 0-5 • /i(|Jj:j g^i?i) and 
each of Rj G O lies on a depth at least 0.005n in the binary decision tree, then we are done. Since 
^(Uij go ^i) — ^-^ ' ^(Uk G7^ ^i) — 0-5 • {fJ-{A) — 1/lOOt) = il(l) and querying inputs in each rectangle 
in O costs r2(n) bits. 

We prove this by contradiction. Suppose that there exists C C 7^ such that ^(Ur eO' ^«) ^ ^-^ ' 
^(Uk g7^ ^*) ^^'^ ^^'^'^ °^ -^« ^ ^' ^^^^ °^ ^ depth less than O.OOSn in the binary decision tree. We have the 
following two facts. 

1. There are at most 2°°°^" disjoint rectangles that he on depths less than 0.005n, i.e., \0'\ < 2°°°^". 

2. 11 {[JR,eO'(^i ^ ^)) > 0-5 - 1/lOOt. 

Combining the two facts with Lemma [BTT] we reach the following contradiction of our error bound. 

lil\J{R,nB)\ >isi [J {RiHB 

> Y^ (l/40t • ti{Ri nA)- 2-0-01") 

i?iGO' 

>l/40t • (0.5 - 1/lOOt) - 2°-°°^" • 2-°-°i'" 

>l/100t D 

Proof for Lemma IB.li The distribution fi can be interpreted as follows: We first choose a random 
partition T = {To,Ti,{i}) of [n], such that To,Ti,{i} ai'e disjoint and |ro| = \Ti\ = 2/ - 1. Next, with 
probability l/\/t, we choose Alice's input x to be {i} union a random subset of Tq with size (Z — 1); and 
with probability 1 — l/\/t, we choose x to be a random subset of Tq with size I. Similarly, with probability 
l/Vt, we choose Bob's input y to be {i} union a random subset of Ti with size (/ — 1); and with probability 
1 — l/\/t, we choose y to be a random subset of Ti with size /. 

Given a partition T = (Tq, Ti, {i}), a rectangle R = C x D, and Alice and Bob's inputs x and y, we 
define 

p{T) = Pr[x eC\T] q{T) = Pr[y G C | T] 
po(T) = Pr[x G C7 I T,i x] q^iT) = Pr[y g C | r,i y] 
pi{T) = Pr[x G C I r,i G j;] qi{T) = Pr[y G C | r,i G y] 

We have the following observations. 

Observation B.l. a. p{T) = (l-l/^/t)■pQ{T) + l/^ft■pl{T). Similarly, q{T) = {I - I / ^/t) ■ qQ{T) + 
l/Vt-qi{T). 

b. IfTi is fixed, then qo and p are fixed. Similarly, I/Tq is fixed, then pQ and q are fixed. 

c. Bt[po{T)\Ti] = Bt[p{T)\Ti]. 

We define T is x-bad if pi{T) < l/3-po{T) -2-'^-^^'',andT isy-badif qi{T) < 1/3- go(r) - 2-°-0^". 
We say T is bad if it is either x-bad or y-bad. Let xo j Xi ^^d x be the indicate variables for the events that 
T is x-bad, y-bad and bad, respectively. We have following lemmas. 
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Lemma B.2. |i30J. Pr[xo{T) \ Ti] < 1/5, andPr[xi{T) \ Tq] < 1/5. 

Lemma B.3. 

E[po(T)Qo(r)(l - x{T))] > (l - Y^0^) Ebo(T)(Zo(T)] 

Proof. By symmetry we only need to prove 

E[po(T)Qo(r)xo(r)] < Ly^r) E[po(r)go(r)] 



We prove for each fixed Ti. Recall Observation IB .li b that when Ti is fixed, qo and p also fixed. 

B[po{T)qo{T)xo{T) \ Ti] 

^ '?o-- ^E[xo(T)|ri] (ObservationUJa) 

1 — 1/vc 



1-1/^ 



^/^ goEb(T) I Ti] 



1 - 1/Vt 
1/5 



1 - 1/Vt 
1/5 

1-1/A/t 



goE[po(T) I Ti] (Observation imic) 

B[po{T)qo{T) I Ti] 



D 



Lemma B.4. fi{AnR) = (1 - 1/t) ■B[po{T)qo{T)], and fi{B n R) = 1/t ■E[pi{T)qi{T)]. 

Proof. 

n{BnR) = fi{B) ■ fi{R \ B) 

= 1/t ■ ^(Pr[r] • Pr[(2;, y) e R\T,i e x,i € y]) 

T 
= l/t-Et[p,{T)q,{T)] 

And 

fj,{AnR) = fi{A) ■ fj,{R \ A) 

= (1 - 1/t) ■ Y, (Pr(r) • Pr[(x, y)eR\T,{t^x,i^y)V{iex,t^y)V{i^x,ie y)]) 

T 

= (1 - 1/t) • E[po(T)go(T)] 

D 
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Proof, (for Lemma IbTT] ) 



^l{Br^R) = i/f •Ebi(r)gi(T)] 

> iA-Ebi(r)gi(T)(i-x(r))] 

> l/t ■ E[(l/3 • po{T) - 2-0-0i«)(l/3 • qoiT) - 2-^-^'^){l - xiT))] 

> l/t • 1/9 • (i - Y^Y/Vl) ■ ^[po(^)'^o(^)] - 2"°-°'" 



^/^ I ^ ,.^ /I r^ D\ o-O.Oln 



> l/40t • /x(yl nR)-2' 



-O.Oln 
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C Omitted Proofs Graph Connectivity 

We provide here a full proof for the probability of the event ^i , that both subset of the graph L and R are 
connected. 

Lemma \42\ (restated), ^i happens with probability at least 1 — l/2n when fc > 681nn + 1. 

Proof. First note that by our construction, both \L\ and \R\ are il(?i) with high probability. To locally 
simplify notation, we consider a graph {V,E) of n nodes where edges are drawn in {k — 1) > 68 Inn 
rounds, and each round n/4 disjoint edges are added to the graph. If {V, E) is connected with probability 
(1 — l/4n), then by union bound over U^^2^j C\ ^L and U^^g^j fl ^R^ Ci is true with probability (1 — l/2n). 
The proof follows four steps. 

(SI): All points have degree at least Sinn. Since for each of k rounds each point's degree increases by 1 
with probability 1/2, then the expected degree of each point is 14 log n after the first 28 In n rounds. 
A Chernoff-Hoeffding bound says that the probability that a point has degree less than 8 In n is at 
most2exp(-2(61nn)2/(141nn)) < 2exp(-51nn) < 2/n^ < l/12n^. Then by the union bound, 
this holds for none of the n points with probability at least 1 — l/12n. 

(S2): Conditioned on (SI), any subset S C V ofh < n/10 points is connected to at least min{/i In n, n/10} 
distinct points in V \ S. At least 9n/10 points are outside of S, so each point in S expects to be 
connected at least (9/10)8 Inn > 71nn times to a point outside of 5. Each of these edges occur 
in different rounds, so they are independent. Thus we can apply a Chernoff-Hoeffding bound to 
say the probability that the number of edges outside of 5 for any point is less than 3 In n is at most 
2exp(— 2(41nn)^/(81nn)) = 2exp(— 41nn) = 2/n^. Thus the probability that no point in S has 
fewer than Inn edges outside S is (since h < n/10) at most l/5n^. 

If the /i • 3 In n edges outside of S (for all h points) are drawn independently at random, then we need 
to bound the probability that these go to more than n/10 distinct points or h In n distinct points. Since 
the edges are drawn to favor going to distinct points in each round, it is sufficient to analyze the case 
where all of the edges are independent, which can only increase the chance they collide. In either case 
hlnn < n/10 or hlnn > n/10 each time an edge is chosen (until ?i/10 vertices have been reached, in 
which case we can stop), 9/10 of all possible vertices are outside the set of edges already connected 
to. So if we select the 3/i In n edges one at a time, each event connects to a distinct points with 
probability at least 9/10, so we expect at least (9/10)(3/iln n) > 2/ilnn distinct points. Again by a 
Chernoff-Hoeffding bound, the probability that fewer than hlnn distinct points have been reached is 
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atmost2exp(-2(/ilnn)V(3/in)) < 2exp(-(2/3)/ilnn) < 2exp(-51nn) < 2/rv' < l/hv? (for 
h > 8). Together the probability of these events not happening is at most l/2n^. 

(S3): There is a single connected component So of size at least n/W. Start with any single point, we know 
from Step 1, its degree is at least 8 In n. Then we can consider the set S formed by these hi = 8lnn 
points, and apply Step 2 to find another hi Inn = 81n^ n = /12 points; add these points to S. The 
process iterates and at each round hi = 8 In* n, by growing only from hi the newly added points. So, 
by round i = lnn the set S = Sq has grown to at least size n/10. Taking the union bound over these 
Inn rounds shows that this process fails with probability at most l/12n. 

(S4): All points in V\Sc are connected to Sq- Each round each point p is connected to Sq with probability 
at least 1/20. So by coupon collector's bound, using the last 40 Inn rounds all points are connected 
after 2 In n sets of 20 rounds with probability at least 1 — l/12n. 

By union bound, the probabiUty of steps (S 1), (S3), and (S4) are successful is at least 1 — l/4n, proving our 
claim. 

D 
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